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Abstract 

An instance of the maximum constraint satisfaction problem (Max CSP) is a finite 
collection of constraints on a set of variables, and the goal is to assign values to the 
variables that maximises the number of satisfied constraints. Max CSP captures many 
well-known problems (such as Max fc-SAT and Max Cut) and is consequently NP- 
hard. Thus, it is natural to study how restrictions on the allowed constraint types (or 
constraint language) affect the complexity and approximability of Max CSP. The PGP 
theorem is equivalent to the existence of a constraint language for which Max CSP has 
a hard gap at location 1, i.e. it is NP-hard to distinguish between satisfiable instances 
and instances where at most some constant fraction of the constraints are satisfiable. All 
constraint languages, for which the CSP problem (i.e., the problem of deciding whether 
all constraints can be satisfied) is currently known to be NP-hard, have a certain algebraic 
property. We prove that any constraint language with this algebraic property makes Max 
CSP have a hard gap at location 1 which, in particular, implies that such problems cannot 
have a PTAS unless P = NP. We then apply this result to Max CSP restricted to a 
single constraint type; this class of problems contains, for instance. Max Cut and Max 
DiCuT. Assuming P / NP, we show that such problems do not admit PTAS except in 
some trivial cases. Our results hold even if the number of occurrences of each variable 
is bounded by a constant. We use these results to partially answer open questions and 
strengthen results by Engebretsen et al. [Theor. Comput. Sci., 312 (2004), pp. 17- 
45], Feder et al. [Discrete Math., 307 (2007), pp. 386-392], Krokhin and Larose ]Proc. 
Principles and Practice of Constraint Programming (2005), pp. 388-402], and Jonsson 
and Krokhin [J. Comput. System Sci., 73 (2007), pp. 691-702]. 
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1 Introduction 



Many combinatorial optimisation problems are NP-hard so there has been a great interest 
in constructing approximation algorithms for such problems. For some optimisation prob- 
lems, there exist powerful approximation algorithms known as polynomial-time approximation 
schemes (PTAS). An optimisation problem 11 has a PTAS A if, for any fixed rational c > 1 and 
for any instance 2 of 11, A{J,c) returns a c-approximate (i.e., within c of optimum) solution 
in time polynomial in \2\. There are some well-known NP-hard optimisation problems that 
have the highly desirable property of admitting a PTAS: examples include Knapsack [32], 
Euclidean Tsp [2], and Independent Set restricted to planar graphs [6l [45]. It is also 
well-known that a large number of optimisation problems do not admit PTAS unless some 
unexpected collapse of complexity classes occurs. For instance, problems like Max fc-SAT [1] 
and Independent Set [5] do not admit a PTAS unless P = NP. We note that if 11 is a 
problem that does not admit a PTAS, then there exists a constant c > 1 such that 11 cannot 
be approximated within c in polynomial time. 

The constraint satisfaction problem (CSP) [52] and its optimisation variants have played 
an important role in research on approximability. For example, it is well known that the 
famous PCP theorem has an equivalent reformulation in terms of inapproximability of some 
CSP [UimiSl], and the recent combinatorial proof of this theorem [25] deals entirely with 
CSPs. Other important examples include Hastad's first optimal inapproximability results [31] 
and the work around the unique games conjecture of Khot [iSiiiO, 40j. 

We will focus on a class of optimisation problems known as the maximum constraint satis- 
faction problem (Max CSP). The most well-known examples in this class probably are Max 
fc-SAT and Max Cut. 

We are now ready to formally define our problem. Let D he a. finite set. A subset R C £)" 

(k) 

is a relation and n is the arity of R. Let R}j be the set of all fc-ary relations on D and let 
Rd — ^^iZiR^D ■ ^ constraint language is a finite subset of Rd- 

Definition 1.1 (CSP(r)) The constraint satisfaction problem over the constraint language 
T, denoted CSP(r), is defined to be the decision problem with instance {V,C), where 

• V is a set of variables, and 

• C is a collection of constraints {Ci, . . . , Cg}, in which each constraint Ci is a pair {Ri, Si) 
with Si a list of variables of length Ui, called the constraint scope, and Ri is an ni-ary 
relation in Rd, called the constraint relation. 

The question is whether there exists an assignment s : V ^ D which satisfies all con- 
straints in C or not. A constraint {Ri, (vi-^jVi^, . . . ,Vi^,)) € C is satisfied by an assign- 
ment s if the image of the constraint scope is a member of the constraint relation, i.e., if 
(s(u,J,s(w,J,...,s(wj„.)) Gi?,. 

Many combinatorial problems are subsumed by the CSP framework; examples include 
problems in graph theory |30], combinatorial optimisation [38], and computational learn- 
ing [22]. We refer the reader to [TT] for an introduction to this framework. 

For a constraint language T C Rjj, the optimisation problem Max CSP(r) is defined as 
follows: 

Definition 1.2 (Max CSP(r)) Max CSP(r) is defined to be the optimisation problem with 
Instance: An instance {V,C) o/CSP(r). 
Solution: An assignment s : V ^ D to the variables. 
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Measure: Number of constraints in C satisfied by the assignment s. 



We use collections of constraints instead of just sets of constraints as we do not have any 
weights in our definition of Max CSP. Some of our reductions will make use of copies of one 
constraint to simulate something which resembles weights. We choose to use collections instead 
of weights because bounded occurrence restrictions are easier to explain in the collection 
setting. Note that we prove our hardness results in this restricted setting without weights and 
with a constant bound on the number of occurrences of each variable. 

Throughout this report, Max CSP(r)-fc will denote the problem Max CSP(r) restricted 
to instances with the number of occurrences of each variable is bounded by k. For our hardness 
results we will write that Max CSP(r)-i3 is hard (in some sense) to denote that there is a A: 
such that Max CSP(r)-A: is hard in this sense. If a variable occurs t times in a constraint which 
appears s times in an instance, then this would contribute t ■ s to the number of occurrences 
of that variable in the instance. 

Example: Given a (multi)graph G = {V,E), the Max k-CvT problem, /c > 2, is the 
problem of maximising \E'\, E' C E, such that the subgraph G' = {V, E') is fc-colourable. For 
k — 2, this problem is known simply as Max Cut. The problem Max /c-Cut is known to be 
APX-complete for any k (it is Problem GT33 in [6]), and so has no PTAS. Let Nk denote 
the binary disequality relation on {0, 1, . . . , fc — 1}, k > 2, that is, (x, y) G Nk <=> x ^ y. 
To see that Max CSP({A^fe}) is precisely Max fc-CuT, think of vertices of a given graph as 
of variables, and apply the relation to every pair of variables x,y such that {x,y) is an edge 
in the graph, with the corresponding multipHcity. 

Most of the early results on the complexity and approximability of CSP and Max CSP 
were restricted to the Boolean case, i.e. when D = {0, 1}. For instance, Schaefer [53j char- 
acterised the complexity of CSP(r) for all T over the Boolean domain, the approximability 
of Max CSP(r) for all T over the Boolean domain have also been determined [iHl [201 138] . 
It has been noted that the study of non-Boolean CSP seems to give a better understanding 
(when compared with Boolean CSP) of what makes CSP easy or hard: it appears that many 
observations made on Boolean CSP are special cases of more general phenomena. Recently, 
there has been some major progress in the understanding of non-Boolean CSP: Bulatov has 
provided a complete complexity classification of the CSP problem over a three-element domain 
[9] and also given a classification of constraint languages that contain all unary relations [7]. 
Corresponding results for Max CSP have been obtained by Jonsson et al. [36] and Deineko et 
al. t23j. 

We continue this line of research by studying two aspects of non-Boolean Max CSP. The 
complexity of CSP(r) is not known for all constraint languages F — it is in fact a major open 
question [121 IlHI- However, the picture is not completely unknown since the complexity of 
CSP(r) has been settled for many constraint languages [51 fTOl fT2l fTSl [33 l [33] . 

It has been conjectured [28j that for all constraint languages F, CSP(F) is either in P or is 
NP-complete, and the refined conjecture [12j (which we refer to as the "CSP Conjecture", see 
Section [331 for details) also describes the dividing line between the two cases. Recall that if P 
^ NP, then Ladner's Theorem [43j states that there are problems of intermediate complexity, 
i.e., there are problems that are not in P and not NP-complete. Hence, we cannot rule out a 
priori if there is a constraint language F such that CSP(F) is neither in P nor NP-complete. 
If the CSP Conjecture is true, then the family of constraint languages which are currently 
known to make CSP(F) NP-complete consists of all constraint languages with this property. 

In the first part of the report we study the family of all constraint languages F such that it 
is currently known that CSP(F) is NP-hard. We prove that each constraint language in this 
family makes Max CSP(F) have a hard gap at location 1. "Hard gap at location 1" means that 
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it is NP-hard to distinguish instances of Max CSP(r) in which all constraints are satisfiable 
from instances where at most an e-fraction of the constraints are satisfiable (for some constant 
e which depends on F)^. It is immediate that having a hard gap at location 1 excludes the 
existence of a PTAS for the problem. The result is proved in Section [3] (Theorem I3.6p and 
can be stated as follows (we refer the reader to Section [3] for an introduction to the algebraic 
terminology) . 

Result A (Hardness at gap location 1 for Max CSP): Let F be a core constraint language 
and let A be the algebra associated with F. If A'^ has a factor which only has projections as 
term operations, then Max CSP(F) has a hard gap at location 1. The result holds even if we 
have a constant bound on the number of variable occurrences. 

A similar result holds when the problem is restricted to satisfiable instances only (Corol- 
lary [3lT4l). We note that for the Boolean domain and without the bounded occurrence restric- 
tion, Result A follows from a result of Khanna et al. [38l Theorem 5.14]. 

Interestingly, the PCP theorem is equivalent to the fact that, for some constraint language 
F over some finite set D, Max CSP(F) has a hard gap at location 1 [lESllSg. Clearly, 
CSP(F) cannot be polynomial time solvable in this case. For any constraint language F 
satisfying the condition from Result A, the problem CSP(F) is known to be NP-complete, 
and it has been conjectured |12] that CSP(F) is in P for all other (core) constraint languages 
(see Section [3^ . Thus, Result A states that Max CSP(F) has a hard gap at location 1 
for any constraint language such that CSP(F) is known to be NP-complete. Moreover, if the 
above mentioned conjecture holds, then Max CSP(F) has a hard gap at location 1 whenever 
CSP(F) is not in P. Another equivalent reformulation of the PCP theorem states that the 
problem Max 3- SAT has a hard gap at location 1 [11 [55], and our proof of consists of a gap 
preserving reduction from this problem. 

We also show how Result A can be used for partially answering two open questions. The 
first one was posed by Engebretsen et al. L26j and concerns the approximability of a finite 
group problem while the second was posed by Feder et al. and concerns the hardness of 
CSP(F) with the restriction that each variable occurs at most a constant number of times, 
under the assumption that CSP(F) is NP-complete. 

The techniques we use to prove Result A are partly from universal algebra — such methods 
have previously proved to be very useful when studying the complexity of CSP [9l [TO t fT2 t [13 1 
[33l [34] . However, they have not previously been used to prove hardness results for Max 
CSP. Typically, the algebraic combinatorial property of supermodularity and the technique 
of strict implementations (see Section 14.31 and Section 12.21 respectively, for the definitions) 
have been used when proving results of this kind. This is, for instance, the case in [231 ISS] 
where it is proved that for any constraint language F over D such that F includes the set 
Cd = {{(2;)} I X G D} or D has at most three elements. Max CSP(F) is either solvable (to 
optimality) in polynomial time or else APX-hard (in which case it cannot have a PTAS unless 
P = NP). 

The second aspect of Max CSP we study is the case when the constraint language consists 
of a single relation; this class of problems contains some of the most well-studied examples of 
Max CSP such as Max Cut and Max DiCut. Before we state this result we need to define 
some terminology. For a relation R we shall say that R is d-valid if (d, . . . , d) £ i? for d £ D 
and simply valid if R is d-valid for some d £ D. Informally, our main result on this problem 
is (see Theorem 14. II for the formal statement): 

^Some authors consider the promise problem GAP-CSP[e,l] where an instance is a Max CSP instance 
(V, C) and the problem is to decide between the following two possibilities: the instance is satisfiable, or at 
most £ ■ \C\ constraints are simultaneously satisfiable. Obviously, if a Max CSP(r) has a hard gap at location 
1, then there exists an e such that the corresponding GAP-CSP[e, 1] problem is NP-hard. 
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Result B (Approximability of single relation Max CSP): Let i? be a relation in Rd- 
If R is empty or valid, then Max CSP({i?}) is trivial. Otherwise, there exists a constant c 
(which depends on R) such that it is NP-hard to approximate Max CSP({-R}) within c. The 
result holds even if we have a constant bound on the number of variable occurrences. 

Jonsson and Krokhin [37] have proved that every problem Max CSP({i?}) with R neither 
empty nor vaHd is NP-hard. We strengthen their theorem by proving Result B; to do so, we 
make use of Result A. Note that for some Max CSP problems such approximation hardness 
results are known, e.g.. Max Cut and Max DiCut. Our result extends those hardness 
results to all possible relations. As a corollary to this result we get that if Max CSP({i?}) is 
NP-hard, then there is no PTAS for Max CSP({-R}) (assuming P NP). Note that a full 
complexity classification of single-relation CSP is not known. In fact, Feder and Vardi [28] 
have proved that by providing such a classification, one has also classified the CSP problem 
for all constraint languages. 

In Section [4731 we strengthen two earlier published results on Max CSP in various ways 
— the common theme is that Result B is used to obtain the results. The reader is referred 
to Section [HH] for definitions of the relevant concepts used below to describe the results. We 
prove that unless P = NP, constraint languages which contain all at most binary 2-monotone 
relations on a partially ordered set which is not a lattice order give rise to a Max CSP problem 
which is hard to approximate. The other result states that constraint languages which contain 
all at most binary 2-monotone relations on a lattice and is not supermodular on the lattice 
make Max CSP hard to approximate. These two problems have previously been studied by 
Krokhin and Larose [lH |l2] . 

Here is an overview of the report: In Section[2]we define some concepts we need. Section[3] 
contains the proof for our first result and Section [4] contains the proof of our second result. In 
Section [4731 we strengthen some earlier published results on Max CSP as mentioned above. 
We give a few concluding remarks in Section [5l 

2 Preliminaries 

A combinatorial optimisation problem is defined over a set of instances (admissible input data); 
each instance I has a set sol(Z) of feasible solutions associated with it, and each solution 
y G sol(I) has a value m(I, y). The objective is, given an instance I, to find a feasible solution 
of optimum value. The optimal value is the largest one for maximisation problems and the 
smallest one for minimisation problems. A combinatorial optimisation problem is said to 
be an NP optimisation (NPO) problem if its instances and solutions can be recognised in 
polynomial time, the solutions are polynomially-bounded in the input size, and the objective 
function can be computed in polynomial time (see, e.g., 0). 

Definition 2.1 (Performance ratio) A solution s G sol(I) to an instance I of an NPO 
problem 11 is r-approximate if 



where OPt(I) is the optimal value for a solution to X. An approximation algorithm for an 
NPO problem H has performance ratio R{n) if, given any instance X of II with \X\ — n, it 
outputs an Rin) -approximate solution. 

PO is the class of NPO problems that can be solved (to optimality) in polynomial time. 
An NPO problem 11 is in the class APX if there is a polynomial time approximation algorithm 
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for n whose performance ratio is bounded by a constant. The following result is well-known 
(see, e.g., |16[ Proposition 2.3]). 

Lemma 2.2 Let D he a finite set. For every constraint language T C Rd, Max CSP(r) 
belongs to APX. Moreover, if a is the maximum arity of any relation in T, then there is a 
polynomial time algorithm which, for every instance X = (V, C) 0/ Max CSP(r), produces a 
solution satisfying at least constraints. 

Definition 2.3 (Hard to approximate) We say that a problem H is hard to approximate 
if there exists a constant c such that, 11 is NP-hard to approximate within c (that is, the 
existence of a polynomial-time approximation algorithm for H with performance ratio c implies 
P =NP). 

The following notion has been defined in a more general setting by Petrank [50] . 

Definition 2.4 (Hard gap at location a) MAX CSP(r) has a hard gap at location a < 1 
if there exists a constants < a and a polynomial-time reduction from an NP -complete problem 
n to Max CSP(r) such that, 

• Yes instances of II are mapped to instances T = (V, C) such that OPT {I) > a\C\, and 

• No instances ofH are mapped to instances 2 = {V,C) such that OPT(2r) < s\C\. 

Note that if a problem 11 has a hard gap at location a (for any a) then 11 is hard to 
approximate. This simple observation has been used to prove inapproximability results for a 
large number of optimisation problems. See, e.g., jSj [HI [55] for surveys on inapproximability 
results and the related PCP theory. 

2.1 Approximation Preserving Reductions 

To prove our approximation hardness results we use AP -reductions. This type of reduction 
is most commonly used to define completeness for certain classes of optimisation problems 
(i.e., APX). However, no APX-hardness results are actually proven in this report since we 
concentrate on proving that problems are hard to approximate (in the sense of Definition [231) • 
We will frequently use ^P-reductions anyway and this is justified by Lemma ITgI below. Our 
definition of ^F-reductions follows [20, 38j. 

Definition 2.5 (AP-reduction) Given two NPO problems Hi and II2 an AP-reduction 
from Hi to II2 is a triple {F, G, a) such that, 

• F and G are polynomial-time computable functions and a is a constant; 

• for any instance X of Hi, F{X) is an instance 0/112; 

• for any instance X 0/ IIi, and any feasible solution s' of F{X), G{X,s') is a feasible 
solution ofX; 

• for any instance X of Hi, and any r > 1, if s' is an r-approximate solution of F{X) then 
G{X,s') is an (1 + (r — l)a + o{l)) -approximate solution of X where the o-notation is 
with respect to \X\. 

If such a triple exist we say that Hi is AP -reducible to H2. We use the notation Hi <ap H2 
to denote this fact. 
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It is a well-known fact (see, e.g., Section 8.2.1 in [6]) that v4P-reductions compose. The 
following simple lemma makes v4P-reductions useful to us. 

Lemma 2.6 If Hi <ap ^2 and Hi is hard to approximate, then II2 is hard to approximate. 

Proof: Let c > 1 be the constant such that it is NP-hard to approximate Hi within c. Let 
(F, G, a) be the v4P- reduction which reduces Hi to 112. We will prove that it is NP-hard to 
approximate 112 within 

r = - c - 1) + 1 - e' 
a 

for any e' > 0. 

Let li be an instance of Hi. Then, X2 = F{Ii) is an instance of 112. Given an r-approximate 
solution to X2 we can construct an (1 + [r — \)a + o(l))-approximate solution to Ii using G. 
Hence, we get an 1 + (r — l)a + o(l) = c — ae' + o(l) approximate solution to Ti, and when 
the instances are large enough this is strictly smaller than c. As c > 1 we can choose e' such 
that e' > and c- ae' > 1. □ 



2.2 Reduction Techniques 

The basic reduction technique in our approximation hardness proofs is based on strict im- 
plementations and perfect implementations. Those techniques have been used before when 
studying Max CSP and other CSP-related problems [20, 36, 38J. 

Definition 2.7 (Implementation) A collection of constraints Ci,...,C„i over a tuple of 
variables x = {xi, . . . , Xp) called primary variables and y = (yi, . . . , yq) called auxiliary vari- 
ables is an a-implementation of the p-ary relation R for a positive integer a if the following 
conditions are satisfied: 

1. For any assignment to x and y, at most a constraints from Ci, . . . , Cm are satisfied. 

2. For any x such that x ^ R, there exists an assignment to y such that exactly a constraints 
are satisfied. 

3. For any x, y such that x ^ R, at most (a — 1) constraints are satisfied. 

Definition 2.8 (Strict /Perfect Implementation) An a -implementation is a strict imple- 
mentation if for every x such that x ^ R there exists y such that exactly (a — 1) constraints are 
satisfied. An a-implementation (not necessarily strict) is a perfect implementation if a = m. 

It will sometimes be convenient for us to view relations as predicates instead. In this case an 
n-ary relation R over the domain D is a. function r : Z?" {0, 1} such that r{x) — 1 <;=^ x G 
R. Most of the time we will use predicates when we are dealing with strict implementations and 
relations when we are working with perfect implementations, because perfect implementations 
are naturally written as a conjunction of constraints whereas strict implementations may 
naturally be seen as a sum of predicates. We will write strict a- implementations in the 
following form 

m 

g{x) + (a - 1) = max V'gi(a;i) 

where x = (xi, . . . , Xp) are the primary variables, y — (yi, . . . , y^) are the auxiliary variables, 
g{x) is the predicate which is implemented, and each Xi is a tuple of variables from x and y. 
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We say that a collection of relations F strictly (perfectly) implements a relation R if, for 
some a G Z"*" , there exists a strict (perfect) a- implementation of R using relations only from 
r. It is not difficult to show that if R can be obtained from F by a series of strict (perfect) 
implementations, then it can also be obtained by a single strict (perfect) implementation (for 
the Boolean case, this is shown in [2^ Lemma 5.8]). 

The following lemma indicates the importance of strict implementations for Max CSP. It 
was first proved for the Boolean case, but without the assumption on bounded occurrences, 
in |20[ Lemma 5.17]. A proof of this lemma in our setting can be found in j23| Lemma 3.4] 
(the lemma is stated in a sHghtly different form but the proof estabHshes the required AP- 
reduction) . 

Lemma 2.9 IfT strictly implements a predicate f , then, for any integer k, there is an integer 
k' such that Max CSP(FU{/})-fc <ap Max CSP(F)-/c'. 

Lemma [2^ will be used as follows in our proofs of approximation hardness: if F' is a fixed 
finite collection of predicates each of which can be strictly implemented by F, then we can 
assume that F' C F. For example, if F contains a binary predicate /, then we can assume, at 
any time when it is convenient, that F also contains f'{x,y) = f{y,x), since this equality is a 
strict 1-implementation of /'. 

For proving hardness at gap location 1, we have the following lemma. 

Lemma 2.10 // a finite constraint language F perfectly implements a relation R and Max 
CSP(FU {i?})-/c has a hard gap at location 1, then Max CSP(F)-fc' has a hard gap at location 
1 for some integer k' . 

Proof: Let N be the minimum number of relations that are needed in a perfect implemen- 
tation of R using relations from F. 

Given an instance 1 — {V,C) of Max CSP(F U {R})-k, we construct an instance I' = 
{V',C') of Max CSP(F)-fc' (where k' will be specified below) as follows: we use the set V" 
to store auxiliary variables during the reduction so we initially let V" be the empty set. For 
a constraint c = (Q, s) g C, there are two cases to consider: 

1. \{ Q ^ R, then add N copies of c to C . 

2. \{ Q = R, then add the implementation of R to C where any auxiliary variables in the 
implementation are replaced with fresh variables which are added to V" . 

Finally, let V — V U V" . It is clear that there exists an integer k' , independent of 1, such 
that T is an instance of Max CSP(F')-A:'. 

If all constraints are simultaneously satisfiable in I, then all constraints in X' are also 
simultaneously satisfiable. On the other hand, if OPt(Z) < e|C| then 

opt(J') < eiV|C| + (1 - e){N - 1)\C\ 
^(e+(l-£)(l-l/iV))|C'|. 

The inequality holds because each constraint in X introduces a group of N constraints in X' 
and, as OPt(X) < e|C|, at most e\C\ such groups are completely satisfied. In all other groups 
(there are (1 — e)\C\ such groups) at least one constraint is not satisfied. We conclude that 
Max CSP(F)-A;' has a hard gap at location 1. □ 

An important concept is that of a core. To define cores formally we need retractions. A 
retraction of a constraint language T C Rjy is a function it : D ^ D such that if D' is the image 
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of TT then 7r(a;) = x for all x G D', furthermore for every i? G F we have (7r(ti), . . . , 7r(f„)) e R 
for all (ti, . . . , i„) G R. We will say that F is a core if the only retraction of F is the identity 
function. Given a relation R G i?^^ and a subset X of we define the restriction of R onto X 
as follows: R\^ = {x € X'' \ x G R}. For a set of relations F we define F|^ — {R\j^ \ R G F}. 
If TT is a retraction of F with image D' , chosen such that \D'\ is minimal, then a core of F is 
the set F|^,. For constraint language F,F' we say that F retracts to T' if there is a retraction 
TT of F such that 7r(F) = F'. 

The intuition here is that if F is not a core, then it has a non-injective retraction tt, which 
implies that, for every assignment s, there is another assignment its that satisfies all constraints 
satisfied by s and uses only a restricted set of values. Consequently the problem is equivalent 
to a problem over this smaller set. As in the case of graphs, all cores of F are isomorphic, so 
one can speak about the core of F. 

Example: Every constraint language F containing all unary relations is a core because the 
only retraction of the set of unary relations is the identity operation. 

The following simple lemma connects cores with non-approximability. 

Lemma 2.11 // F' is the core of T, then, for any k, Max CSP(F')-fc has a hard gap at 
location 1 if and only if Max CSP(F)-fc has a hard gap at location 1. 

Proof: Let tt be the retraction of F such that F' = {t^{R) \ R G F}, where 7r(i?) = {7r(t) | t G 
R}. Given an instance I = {V,C) of Max CSP(F)-fc, we construct an instance I' = {V,C') 
of Max CSP(F')-A: by replacing each constraint (i?, s) G C by (7r(i?), s). 

From a solution s to Z', we construct a solution s' to I' such that s'{x) — 7r(s(a;)). Let 
(-R, s) G C be a constraint which is satisfied by s. Then, there is a tuple a; G i? such that 
s(s) = a; so tt{x) G 7r(_R) and s'{s) = n{s{s)) = n{x) G n{R). Conversely, if {it{R),s) is a 
constraint in J' which is satisfied by s', then there is a tuple x £ R such that s'{s) = 7r(s(s)) = 
'k{x) G 7r(i?), and s{s) = x e R. We conclude that m{X, s) — m{l' , s'). 

It is not hard to see that we can do this reduction in the other way too, i.e., given an 
instance T = {V',C') of Max CSP(F')-A:, we construct an instance I of Max CSP(F)-A: 
by replacing each constraint (7r(i?),s) G C by {R,s). By the same argument as above, this 
direction of the equivalence follows, and we conclude that the lemma is valid. □ 

An analogous result holds for the CSP problem, i.e., if F' is the core of F, then CSP(F) is 
in P (NP-complete) if and only if CSP(F') is in P (NP-complete); see p3] for a proof. Cores 
play an important role in Section IH too. We have the following lemma: 

Lemma 2.12 (Lemma 2.11 in t36j) // F' is the core of T, then Max CSP(F')-B <ap 
Max CSP(F)-S. 

The lemma is stated in a slightly different form in [36] but the proof establishes the required 
AP-reduction. 

3 Result A: Hardness at Gap Location 1 for Max CSP 

In this section, we will prove Result A which we state as Theorem 13.61 The proof makes use 
of some concepts from universal algebra and we present the relevant definitions and results in 
Section 13.11 and Section 13.21 The proof is contained in Section 13.31 
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3.1 Definitions and Results from Universal Algebra 

We will now present the definitions and basic results we need from universal algebra. For a 
more thorough treatment of universal algebra in general we refer the reader to [HI [T^ . The 
article pt2j contains a presentation of the relationship between universal algebra and constraint 
satisfaction problems. 

An operation on a finite set D is an arbitrary function / : D'' D. Any operation on 
D can be extended in a standard way to an operation on tuples over D, as follows: let / 
be a fc-ary operation on D. For any collection of k n-tuples, ti,t2, • ■ • ,tfc G f", the n-tuple 
/(ti,t2, . . . ,tk) is defined as follows: 



where tj [i] is the z-th component in tuple tj . If f{d,d, ... ,d) = d for all d & D, then / is said 
to be idempotent. An operation / : D'' — + D which satisfies f{xi,X2,-.-,Xk) = Xi, for some i, 
is called a projection. 

Let i? be a relation in the constraint language F. If / is an operation such that for all 
ti,t2, ■ ■ ■ ,tk G we have /(<i,t2, • ■ • ,*fc) G R, then R is said to be invariant (or, in other 
words, closed) under /. If all constraint relations in F are invariant under /, then F is said to 
be invariant under /. An operation / such that F is invariant under / is called a polymorphism 
of F. The set of all polymorphisms of F is denoted Pol(F). Given a set of operations F, the 
set of all relations that is invariant under all the operations in F is denoted Inv(i^). 

Example: Let D = {0,1,2} and let R be the directed cycle on D, i.e., R ~ {(0,1), 
(1,2), (2,0)}. One polymorphism of R is the operation / : {0,1,2}'^ {0,1,2} defined as 
/(x, y, z) = X — y + z (mod 3). This can be verified by considering all possible combinations 
of three tuples from R and evaluating / component-wise. Let K be the complete graph on 
D. It is well known and not hard to check that if we view X as a binary relation, then all 
idempotent polymorphisms of K are projections. 

We continue by defining a closure operator (•) on sets of relations: for any set F C i?^), the 
set (F) consists of all relations that can be expressed using relations from F U {EQd} (where 
EQe) denotes the equality relation on D), conjunction, and existential quantification. Those 
are the relations definable by primitive positive formulae (pp-formulae) . As an example of a 
pp-formula consider the relations A = {(0, 0), (0, 1), (1, 0)} and B = {(1, 0), (0, 1), (1, 1)} over 
the Boolean domain {0, 1}. With those two relations we can construct / = {(0, 0), (0, 1), (1, 1)} 
with the pp-formula 



Note that pp-formulae and perfect implementations from Definition 12.81 are the same concept. 
Intuitively, constraints using relations from (F) are exactly those which can be simulated by 
constraints using relations from F in the CSP problem. Hence, for any finite subset F' of 
(F), CSP(F') is not harder than CSP(F). That is, if CSP(F') is NP-complete for some finite 
subset F' of (F), then CSP(F) is NP-complete. If CSP(F) is in P, then CSP(F') is in P for 
every finite subset F' of (F). We refer the reader to [34] for a further discussion on this topic. 

The sets of relations of the form (F) are referred to as relational clones, or co-clones. An 
alternative characterisation of relational clones is given in the following theorem. 



/ (tl , <2 , . . . , tfc ) = (/(tl [1] , t2 [1] , . . . , tfe [1] ) , /(<1 [2] , t2 [2] , 

fiti[nlt2[nl...,tk[n])), 



,tk[2]),... 




Theorem 3.1 ([51]) 



• For every set F C i?^,, (F) = Inv(Pol(F)). 
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• //r' c (r), then Poi(r) c Poi(r'). 

We will now define finite algebras and some related notions which we need later on. The 
three definitions below closely follow the presentation in p2]- 

Definition 3.2 (Finite algebra) A finite algebra is a pair A — {A; F) where A is a finite 
non-empty set and F is a set of finitary operations on A. 

We will only make use of finite algebras so we will write algebra instead of finite algebra. 
An algebra is said to be non-trivial if it has more than one element. 

Definition 3.3 (Homomorphism of algebras) Given two algebras A ~ {A; Fa) and B ~ 

{B;Fb) such that Fa = {ff' \ i e I}, Fb ^ {ff Me/} and both ff and ff are nt-ary for 
all i I , then ip : A ^ B is said to be an homomorphism from A to B if 

'P{ft{ai,a2, ...,«„,)) = /f (v?(ai), (^(03), . . . , <^(a„J) 

for all i € I and ai, a2, . . . , a„. G A. If ip is surjective, then B is a homomorphic image of A. 

Given a homomorphism Lp mapping A = {A; Fa) to B — {B;Fb), we can construct a 
equivalence relation 6 on A as 6 = {{x,y) \ (p{x) = fiy)}- The relation 9 is said to be 
a congruence relation of A. We can now construct the quotient algebra A/6 = {A/9; Fa/0). 
Here, A/9 = {x/0 \ x & A} and x/9 is the equivalence class containing x. Furthermore, Fa/9 = 
{f/9 \ f GFa} and f/9 is defined such that f /9{xi/9,X2/9, . . . , Xn/9) = f{xi,X2,...,Xn)/9. 

For an operation / : 13" D and a subset X C D we define /|^ as the function 
g : X" — > D such that g{x) = f{x) for all x E X". For a set of operations F on Z3 we 
define F\^ = {f\^ \ f e F}. 

Definition 3.4 (Subalgebra) Let A — (A; Fa) be an algebra and B C A. If for each f € Fa 
and any 5i, 62, ■ ■ ■ , &n G -B, we have f{bi, 62, ... , 6n) G B, then B = {B; Fa\q) is a subalgebra 
of A. 

The operations in Pol(Inv(fA)) are the term operations of A. If all term operations are 
surjective, then the algebra is said to be surjective. Note that Inv(Fyi) is a core if and only 
if A is surjective OIBlj. If F consist of all the idempotent term operations of A, then the 
algebra {A; F) is called the full idempotent reduct of A, and we will denote this algebra by 
A"^. Given a set of relations T over the domain D we say that the algebra = {D', Pol(r)) is 
associated with T. An algebra B is said to be a factor of the algebra ^ if S is a homomorphic 
image of a subalgebra of A. A non-trivial factor is an algebra which is not trivial, i.e., it has 
at least two elements. 

3.2 Constraint Satisfaction and Algebra 

We continue by describing some connections between constraint satisfaction problems and 
universal algebra. We will also formally state Result A in Theorem 13.61 The following 
theorem concerns the hardness of CSP for certain constraint languages. 

Theorem 3.5 ( [12| ) LetT be a core constraint language. If A^ has a non-trivial factor whose 
term operations are only projections, then CSP(r) is NP-hard. 

It has been conjectured [12] that, for all other core languages F, the problem CSP(r) is 
tractable, and this conjecture has been verified in many important cases (see, e.g., [ZlfO]). 

The first main result of this report is the following theorem which states that Max CSP(F)- 
B has a hard gap at location 1 whenever the condition which makes CSP(r) hard in Theo- 
rem [312] is satisfied. 
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Theorem 3.6 LetT be a core constraint language. If has a non-trivial factor whose term 
operations are only projections, then Max CSP(r)-i3 has a hard gap at location 1. 

The proof of this result can be found in Section 13.31 Note that if the above conjecture is 
true then Theorem 13 .61 describes all constraint languages F for which Max CSP(r) has a hard 
gap at location 1 because, obviously, F cannot have this property when CSP(r) is tractable. 

There is another characterisation of the algebras in Theorem 13.51 which corresponds to 
tractable constraint languages. To state the characterisation we need the following definition. 

Definition 3.7 (Weak Near- Unanimity Function) An operation f : D" D, where 
n >2, is a weak near-unanimity function if f is idempotent and 

f{x, y,y,...,y) = f{y, x,y,y,...,y)^...^ f{y, ...,y,x) 

for all x,y € D. 

Hereafter we will use the acronym wnuf for weak near-unanimity functions. We say that 
an algebra A admits a wnuf if there is a wnuf among the term operations of A. We also say 
that a constraint language F admits a wnuf if there is a wnuf among the polymorphisms of F. 
By combining a theorem proven by Maroti and McKenzie [48l Theorem 1.1] with a result by 
Bulatov and Jeavons [TT| Proposition 4.14], we get the following: 

Theorem 3.8 Let A be an idempotent algebra. The following are equivalent: 

• There is a non-trivial factor B of A such that B only have projections as term operations. 

• The algebra A does not admit any wnuf. 

3.3 Proof of Result A 

We will now prove Theorem [Ql Let SSATq denote the relation {0, 1}^ \ {(0,0,0)}. We also 
introduce three slight variations of SSATq, let SSATi = {0, 1}^ \ {(1,0, 0)}, SSATa = {0, l}^ \ 
{(1, 1,0)}, and SSATs = {0, 1}^\{(1, 1, 1)}. To simplify the notation we let Tssat = {SSATa, 
3SATi, 3SAT2, 3SAT3}. It is not hard to see that the problem Max CSP(r35^T) is precisely 
Max 3Sat. It is well-known that this problem, even when restricted to instances in which 
each variable occurs at most a constant number of times, has a hard gap at location 1, see 
e.g., [Ml Theorem 7]. We state this as a lemma. 

Lemma 3.9 ([55]) Max CSP(r35yiT)--S has a hard gap at location 1. 
To prove Theorem 13.61 we will utilise expander graphs. 

Definition 3.10 (Expander graph) A d-regular graph G is an expander graph if, for any 
S C F[G], the number of edges between S and V[G] \ S is at least mindS"], \V[G] \ S\). 

Expander graphs are frequently used for proving properties of Max CSP, cf. |2Hl49]. Typ- 
ically, they are used for bounding the number of variable occurrences. A concrete construction 
of expander graphs has been provided by Lubotzky et al. [i6] . 

Theorem 3.11 A polynomial-time algorithm T and a fixed integer N exist such that, for any 
k > N , T{k) produces a 14-regular expander graph with fc(l + o(l)) vertices. 



12 



There are four basic ingredients in the proof of Theorem l3.6l The first three are Lemma [2.101 
Lemma [3^ and the use of expander graphs to bound the number of variable occurrences. We 
also use an alternative characterisation (Lemma I3.12p of constraint languages satisfying the 
conditions of the theorem. This is a slight modification of a part of the proof of Proposition 
7.9 in [l^. The implication below is in fact an equivalence and we refer the reader to [12] for 
the details. Given a function f : D ^ D, and a relation R d Rd, the full preimage of R under 
/, denoted by f~^{R), is the relation {x \ f(x) G R] (as usual, f{x) denotes that / should 
be applied componentwise to x). 

Lemma 3.12 LetV he a core constraint language. If the algebra has a non-trivial factor 
whose term operations are only projections, then there is a subset B of D and a surjective map- 
ping ip : B {0, 1} such that the relational clone (T\JCd) contains the relations lp^'^{?>SATq), 
ip-'^{iSATi), ip-^{'iSAT2), andip-^{iSAT:i)}. 

Proof: Let AJ be the subalgebra of A such that there is a homomorphism Lp from AJ to 
an algebra B whose term operations are only projections. We can assume, without loss of 
generality, that the set {0, 1} is contained in the universe of B. It is easy to see that any 
relation is invariant under any projections. Since B only have projections as term operations, 
the four relations 2>SATq, ZSATx, 'iSAT2 and iSATj, are invariant under the term operations 
of B. It is not hard to check (see ^2j) that the full preimages of those relations under 
are invariant under the term operations of A! and therefore they are also invariant under the 
term operations of ^. By Theorem l3.H this implies {(p"^(3S'Aro), ip^^ {2iS ATi) , Lp^^(?tSAT2), 
p'\iSAT^)}C{VVJCD). □ 

We are now ready to present the proof of Theorem 13.61 Let S' be a permutation group on 
the set X . An orbit of S" is a subset Vt o{ X such that SI = {g{x) \ g € S} for some x £ X . 
Proof: Let At = (D;Pol(r)) be the algebra associated with F. For any a £ D, we denote 
the unary constant relation containing only a by Ca, i.e., Ca = {(«)}■ Let Cd denote the set 
of all constant relations over D, that is, Cd = {ca \ a G D}. By Lemma [3321 there exists a 
subset (in fact, a subalgebra) B of D and a surjective mapping ip : B {0,1} such that the 
relational clone (FUCd) contains ip~^{T3sat) = W~^iR) \ R G T^sat}- By Lemma lS^ Max 
CSP(r3SAT)--B is hard at gap location 1, so, by Lemma [TTTl Max CSP {ip>'^{T3sat))-B is 
also hard at gap location 1 (because T^sat is the core of 'P~^{T3sat})- 

Since F is a core, its unary polymorphisms form a permutation group S on D. We can 
without loss of generality assume that D = {1, . . . It is known (see Proposition 1.3 of 
and not hard to check (using Theorem 13. ip that F can perfectly implement the following 
relation: Rs = {{g{l), . . . ,g{p)) \ g G S}. Then it can also perfectly implement the relations 
EQi for 1 < z < p where EQi is the restriction of the equality relation on D to the orbit in 5" 
which contains i. We have 

EQi{x, y) 321, . . . , Zj+i, . . . , Zp :i?s(-Zi, ■ • • , ^i-i, a;, z^+i, . . . , Zp)A 

Rs{zi, . . . ,Zi_i,y, Zi+i, . . .,Zp). 

For < i < 3, let Ri be the preimage of SSATi under (p. Since i?i G (F U Cd), we can 
show that there exists a {p + 3)-ary relation i?- in (F) such that 

Ri = {{x,y,z) I {l,2,...,p,x,y,z) G R'i}. 

Indeed, since Ri G (FUCd), Ri can be defined by a pp-formula Ri{x, y, z) 3t'0(t, x, y, z) 

(here t denotes a tuple of variables) where ■0 is a conjunction of atomic formulas involving 
predicates from F U Cd and variables from t and {x, y, z}. Note that, in ip, no predicate from 
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Gd is applied to one of {x, y, z} because these variables can take more than one value in Ri. 
We can without loss of generality assume that every predicate from Cd appears in ip exactly 
once. Indeed, if such a predicate appears more than once, then we can identify all variables 
to which it is applied, and if it does not appear at all then we can add a new variable to t 
and apply this predicate to it. Now assume without loss of generality that the predicate q, 
1 < i < p, is applied to the variable ti in i/j, and = ipi A ip2 where ipi = AiLi Ci(ii) and 
ip2 contains only predicates from T \Cd- Let t' be the list of variables obtained from t by 
removing ti, . . . ,tp. It now is easy to check that that the (p + 3)-ary relation R'^ defined by 
the pp-formula 3t''ip2{t,x,y, z) has the required property. 
Choose R'^ to be the minimal relation in (F) such that 

Ri = {{x,y,z) I (1,2, .. . ,p,x,y,z) e R'i}. 

We will show that 

= {(.g(l),.g(2),...,.g(p),5(x),g(y),.g(z)) |.gG 5, {x,y,z)&R,}. 

The set on the right-hand side of the above equality must be contained in i?J because ii^ is 
invariant under all operations in S. On the other hand, if a tuple b = . . . , 5p, rf, e, /) belongs 
to i?-, then there is a permutation g G S such that (6i, . . . , 6p) — (g(l), . . . , g{p)) (otherwise, 
the intersection of this relation with Rs x G (F) would give a smaller relation with the 
required property). Now note that the tuple (1, . . . ,p,g~^{d),g~^{e),g~^{f)) also belongs to 
R[ implying, by the choice of i?^, that {g~^{d), g~^{e), g~^{f)) G Ri. This completes the proof 
and the relation R'^ is as described above. 

To simplify the notation, let F' = {i?^ | < i < 3} U {EQi, EQp}. By Lemma [TTOl in 
order to prove the theorem, it suffices to show that Max CSP(r')-i? has a hard gap at location 
1. By Lemma \3.9\ there is an integer K such that Max CSP(r35^T)--fi^ has a hard gap at 
location 1. Choose K such that K > 14. By Lemma EHH Max CSP{ip~^(r3s at))-K has 
the same property. We will now ^P-reduce Max CSP{ip'^ {T3sat))-K to Max CSP(F')-5. 
Take an arbitrary instance 1 — {V,C) of Max CSP{ip^^(r3SAT))-K, and build an instance 
1' — {V, C) of Max CSP(r') as fofiows: introduce new variables ui, . . . , Up, and replace each 
constraint Ri{x, y, z) in I by i?^(wi, . . . , Mp, x, y, z). Note that every variable, except the u^'s, 
in X' appears at most K times. We will now use expander graphs to construct an instance I" 
of Max CSP(r') with a constant bound on the number of occurrences for each variables. 

Let q be the number of constraints in I and let q' — maxjiV, g}, where N is the constant in 
Theorem 13. Ill Let G = (W, E) be an expander graph (constructed in polynomial time by the 
algorithm T(g') in Theorem 13. lip such that W = {wi,W2, ■ ■ ■ , Wm} and m > q. The expander 
graph r(g') have g'(l + o(l)) vertices. Hence, there is a constant a such that T(g') has at 
most aq vertices. For each 1 < j < p, we introduce m fresh variables w{,W2, ■ . ■ , into X" . 
For each edge {wi,Wk} G E and I < j < p, introduce p copies of the constraint EQj{wj,wl.) 
into C" . Let Ci, C2, . . . , Cg be an enumeration of the constraints in C". Replace uj by in 
Ci for all 1 < * < g- Finally, let C* be the union of the (modified) constraints in C" and the 
equality constraints in C" . It is clear that each variable occurs in X" at most Kp times (recall 
that p = |D| is a constant). 

Clearly, a solution s to I satisfying all constraints can be extended to a solution to I", 
also satisfying all constraints, by setting s(w^) = j for all 1 < i < m and all 1 < j < p. 

On the other hand, if to(I, s) < e|C|, then let s' be an optimal solution toX". We will prove 
that there is a constant s' < 1 (which depends on s but not on X) such that m{X" , s') < e'\C*\. 

We first prove that, for each 1 < j < _p, we can assume that all variables in = 
{wl ,W2, ■ ■ ■ , } have been assigned the same value by s' and that all constraints in C" are 
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satisfied by s'. We show that given a solution s' to X", we can construct another solution S2 
such that 171(2" , S2) > m{X" , s') and S2 satisfies all constraints in C" . 

Let be the value that at least m/p of the variables in have been assigned by s'. We 
construct the solution S2 as follows: S2(wf ) = for all i and j, and 52(2;) = s'(a;) for all other 
variables. 

If there is some j such that X = {x ^ \ s'{x) ^ a^} is non-empty, then, since G 
is an expander graph, there are at least p ■ min(|X|, \W^ \ X\) constraints in C" which are 
not satisfied by s' . Note that by the choice of X, we have \W^ \ X| > m/p which implies 
p ■ min( \X\,\ \X\) > \X\. By changing the value of the variables in X , we will make at most 
\X\ non-equality constraints in C* unsatisfied because each of the variables in occurs in at 
most one non-equality constraint in C* . In other words, when the value of the variables in X 
are changed we gain at least |^| in the measure as some of the equality constraints in C" will 
become satisfied, furthermore we lose at most \X\ by making at most \X\ constraints in C* 
unsatisfied. We conclude that m{X',S2) > m{l',s'). Thus, we may assume that all equality 
constraints in C" are satisfied by s' . 

Since the expander graph G is 14-regular and has at most aq vertices, it has at most ^aq 
edges. Hence, the number of equality constraints in C" is at most 7aqp, and |C"|/|C"| < lap. 
We can now bound m{X" , S2) as follows: 

mii",.s2) < oPT(i') + \c"\ < igli^dc'i + \c"\) < 'y^Uc'I + \c"\). 

Since \C*\ = |C"| + |C"|, it remains to set e' = f±^. □ 

We finish this section by using Theorem l3.6l to answer, at least partially, two open questions. 
The first one concerns the complexity of CSP(r)-i3. In particular, the following conjecture 
has been made by Feder et al. [27|. 

Conjecture 3.13 For any fixed T such that CSP(r) is 'NP-complete there is an integer k 
such that CSP(r)-fc is -complete. 

Under the assumption that the CSP conjecture (that all problems CSP(r) not covered by 
Theorem l3.5l are tractable) holds, an affirmative answer follows immediately from Theorem l3.6l 
So for all constraint languages T such that CSP(r) is currently known to be NP-complete it 
is also the case that CSP(r)-i? is NP-complete. 

The second result concerns the approximability of equations over non-abelian groups. Pe- 
trank [50] has noted that hardness at gap location 1 impHes the following: suppose that we 
restrict ourselves to instances of Max CSP(r) such that there exist solutions that satisfy all 
constraints, i.e. we concentrate on satisfiable instances. Then, there exists a constant c (de- 
pending on r) such that no polynomial-time algorithm can approximate this problem within 
c (unless P = NP). We get the following result for satisfiable instances: 

Corollary 3.14 Let T be a core constraint language and let A be the algebra associated with 
r. Assume there is a factor B of A'^ such that B only have projections as term operations. 
Then, there exists a constant c such that Max CSP(r)-i3 restricted to satisfiable instances 
cannot be approximated within c in polynomial time (unless P = NPJ. 

We will now use this observation for studying a problem concerning groups. Let Q = (G, •) 
denote a finite group with identity element Iq- An equation over a set of variables V is an 
expression of the form wi ■ . . . ■ Wk = 1g, where Wi (for 1 < i < k) is either a variable, 
an inverted variable, or a group constant. Engebretsen et al. [26] have studied the following 
problem: 
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Definition 3.15 (EQg) The computational problem Eq^ (where Q is a finite group) is defined 
to he the optimisation problem with 

Instance: A set of variables V and a collection of equations E over V . 
Solution: An assignment s : V ^ G to the variables. 
Measure: Number of equations in E which are satisfied by s. 

The problem Eq^[3] is the same as EQg except for the additional restrictions that each 
equation contains exactly three variables and no equation contains the same variable more 
than once. Their main result was the following inapproximability result: 

Theorem 3.16 (Theorem 1 in [26j) For any finite group Q and constant e > 0, it is NP- 

hard to approximate EQg[3] within \G\ — e. 

Engebretsen et al. left the approximability of Eq^[3] for satisfiable instances as an open 
question. We will give a partial answer to the approximability of satisfiable instances of EQg;. 

It is not hard to see that for any integer k, the equations with at most k variables over a 
finite group can be viewed as a constraint language. For a group Q, we denote the constraint 
language which corresponds to equations with at most three variables by Tg. Hence, for any 
finite group Q, the problem Max CSP(rg) is no harder than EQg. 

Goldmann and Russell [29] have shown that CSP(rg) is NP-hard for every finite non- 
abehan group Q. This result was extended to more general algebras by Larose and Zadori [ii] . 
They also showed that for any non-abelian group G, the algebra A = (G; Pol(rg)) has a non- 
trivial factor B such that B only have projections as term operations. We now combine Larose 
and Zadori's result with Theorem 13.61 

Corollary 3.17 For any finite non-abelian group Q, EQg has a hard gap at location 1. 

Thus, there is a constant c such that no polynomial-time algorithm can approximate satisfi- 
able instances of EQg better than c, unless P = NP. There also exists a constant k (depending 
on the group Q) such that the result holds for instances with variable occurrence bounded by 
A;. 

4 Result B: Approximability of Single Relation Max CSP 

In this section, we will prove the following theorem: 

Theorem 4.1 Let R G i?^-* be non-empty. If {d,... ,d) e R for some d G D, then Max 
CSP({-R}) is solvable in linear time. Otherwise, Max CSP({-R})-i? is hard to approximate. 

The proof makes crucial use of Theorem 13.61 and it can be divided into a number of steps: 

1. LemmaHUltogether with Lemma HTZl proves that directed cycles are hard to approximate 
(i.e., the theorem holds when R is the edge relation of a directed cycle). 

2. Vertex-transitive digraphs which are not directed cycles are proved to be hard to ap- 
proximate in Lemma l4.6l 

3. Lemma 14.101 give approximation hardness for bipartite digraphs. 

4. Lemma [4. 151 reduces the non- vertex transitive case to the vertex-transitive case. 



16 



5. Lemma [4 . 1 71 reduces general relations to binary relations, i.e., to digraphs. 

6. Finally, Theorem 14. II is proved by assembling the results from the previous sections. 
As indicated by the list above the bulk of the work deals with binary relations. 

4.1 Approximability of Binary Relations 

In this section, we will prove that non-empty non- valid binary relations give rise to Max CSP 
problems which are hard to approximate. Subsection 14.1.11 deals with binary (not necessarily 
symmetric) relations having a transitive automorphism group, and Section 14.1.21 deals with 
general binary relations. 

Sometimes it will be convenient for us to view binary relations as digraphs. A digraph is a 
pair (V, E) such that ^ is a finite set and E C V x V . A graph is a digraph (V, E) such that 
for every pair {x,y) € E we also have iy,x) G E. Let R G Rd be a binary relation. As R is 
binary it can be viewed as a digraph G with vertex set V[G] = D and edge set E[G] — R. We 
will mix freely between those two notations. For example, we will sometimes write {x, y) & G 
with the intended meaning {x, y) G E[G\ = R. 

Let G be a digraph, R = E[G], and let Aut(G) denote the automorphism group of G. If 
Aut(G) is transitive (i.e., contains a single orbit), then we say that G is vertex-transitive. If 
D can be partitioned into two sets, A and B, such that for any x,y £ A (or x,y £ B) we have 
(x, y) ^ i?, then R (and G) is bipartite. The directed cycle of length n is the digraph G with 
vertex set V[G] = {0, 1, . . . , n - 1} and edge set E[G] = {{x, a; + 1) | a; G V"[G]}, where the 
addition is modulo n. Analogously, the undirected cycle of length n is the graph H with vertex 
setV[H] = {0, 1,.. .,n-l} and edge set = {(x,a;+l) | x G V[H]}U{ix+l,x) \ x G V[H]} 
(also in this case the additions are modulo n). The undirected path with two vertices will be 
denoted by P2- 

4.1.1 Vertex-transitive Digraphs 

We will now tackle non-bipartite vertex-transitive digraphs and prove that they give rise to 
Max CSP problems which are hard at gap location 1. To do this, we make use of the algebraic 
framework which we used and developed in Section[3l Recall that we denote the unary constant 
relations over a domain D by Cd, i.e., Go — {{{x)} \ x G D}. We will need certain hardness 
results in the forthcoming proofs. 

Theorem 4.2 ([8J) Let G be an undirected core graph and let Ag be the algebra associated 
with G. If G is not bipartite, then there is a factor of Aq which only have projections as term 
operations. 

Lemma 4.3 Let G be a vertex-transitive core digraph such that \V[G] \ = 3 or |V[G]| = 4. // 
G is not a directed cycle, then G does not admit a wnuf 

Proof: Let v and u be two vertices in a vertex-transitive core digraph. Note that the in- 
and out-degrees of u and v must coincide, and hence the in- and out-degrees of v must be the 
same. 

Having this in mind, it is easy to see that there are only two digraphs with three vertices 
satisfying the conditions in the lemma: the directed cycle and the complete graph on three 
vertices. Similarly, it is easy to see that there are three core digraphs on four vertices which 
are vertex-transitive: the directed cycle, the complete graph on four vertices and the digraph 
in Figure [TJ 
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Figure 1: The non-trivial case in Lemma 14731 

For complete graphs, the results follows from Theorems 14.21 and 13.81 Denote the di- 
graph in Figure [1] by G and consider the following perfect implementation (originally used 
by MacGiUivray gZl step 3 in Theorem 3.4]). 

H{x,y) 3u,v : G{x,u) AG(u,v) AG{v,u) AG{v,y) 

It is not hard to see that H is the complete graph on four vertices. Since H does not admit 
any wnuf, Theorem 13.11 implies that G does not admit a wnuf either. □ 

Lemma 4.4 Let G and H be two digraphs such that there is a retraction from G to H. If G 
admits a wnuf, then so does H. 

Proof: Let r be a retraction from G to H. If G admits a wnuf f{xi, . . . , Xn), then it is easy 
to check that H admits the wnuf r{f{xi, . . . , x„)). □ 

Lemma 4.5 If G is a vertex-transitive digraph that does not retract to a directed cycle, then 
G admits no wnuf. 

Proof: For the sake of contradiction, let G be a digraph with the minimum number of 
vertices such that G is vertex-transitive, does not retract to a directed cycle and G admits a 
wnuf. Furthermore, among all counterexamples with |T^[G]| vertices, let G be the one with 
the maximum number of edges. 

It is well known, and easy to show, that the core of a vertex-transitive digraph is also 
vertex-transitive. If G is not a core, then the core of G is vertex-transitive, admits a wnuf 
(Lemma 14. 4p . and does not retract to a directed cycle. Hence, if G is not a core, then the core 
of G is a smaller counterexample. We can therefore, without loss of generality, assume that 
G is a core. By Lemma we can assume that |^[G]| > 4. We need the latter assumption 
because this is assumed in the proof of Theorem 3.4 in [47j, which we use below. 

For a digraph H, let undir{H) be the digraph induced by the double edges of H. It is easy 
to see that undir(H) can be perfectly implemented from H as follows: 

undir{H){x,y) H{x,y) A H{y,x). (1) 

The proof of Theorem 3.4 in [47] shows that it is possible to perfectly implement a digraph 
H with G and Cvig] such that there is a retraction r from H to a digraph H' which is 
vertex-transitive and 

1. undir{II') is not bipartite and not valid, or 

2. < |^^[G]| and H' does not retract to a directed cycle, or 

3. \V[H']\ = \V[G]\ and \E[H']\ > \E[G]\ and H' does not retract to a directed cycle. 
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In fact, the proof in [47j uses constructions called "indicator" and "subindicator" to obtain 
H from G, but these constructions are well known to precisely correspond to certain perfect 
implementations (or pp-formulas, see [Sj for details). 

Note that, since any wnuf is idempotent, the constraint language {^^[G]} U Cv[g] admits a 
wnuf. Then, by Theorem 13.11 the digraph H admits a wnuf. Lemma FOl applied to H shows 
that H' admits a wnuf as well. Now we see that cases (2) and (3) are impossible, since H' 
would contradict the choice of G. Case (1) leads to a contradiction too because, by Lemmas 14. 21 
and 14. 4^ the core of undir{H'), which is a non-bipartite undirected graph, would also admit a 
wnuf which is impossible by Theorems 14.21 and 13.81 □ 

Corollary 4.6 Let H be a vertex-transitive core digraph which is not valid and not a directed 
cycle. Then, Max CSP{{H})-B has a hard gap at location 1. 

Proof: Immediately follows from Lemma l4?5l Theorem 13.81 and Theorem 13.61 □ 

The next lemmas help to deal with the remaining vertex-transitive graphs, i.e. those that 
retract to a directed cycle. 

Lemma 4.7 If G is the undirected path with two vertices P2, or an undirected cycle Ck, k > 2, 
then Max CSP({G})-i3 is hard to approximate. 

Proof: If G = P2, then the result follows from Example[TJ If G = Gfc and k is even, then the 
core of Ck is isomorphic to P2 and the result follows from Lemmas 12.121 12.61 combined with 
Example [H 

From now on, assume that G = Ck, k is odd, and fc > 3. We will show that we can strictly 
implement Nk, i.e., the inequality relation. We use the following strict implementation 

Nkizi, Zk-i) + [k - 3) ^ max Gfe(zi, Z2) + Gfc(z2, Z3) + . . . + 

22,23, ...,2:^-2 

Gfc(zfe-3, Zk-2) + Gfc(zfe-2, zfe-i). 

It is not hard to see that if zi ^ Zk^i, then all /c — 2 constraints on the right hand side can be 
satisfied. If zi = Zk-i, then A: — 3 constraints are satisfied by the assignment Zi = zi + i — 1, 
for all i such that 1 < i < A: — 1 (the addition and subtraction are modulo k). Furthermore, no 
assignment can satisfy all constraints. To see this, note that such an assignment would define 
a path zi, Z2, . . . , Zk-i in Ck with k ~2 edges and zi — Zjt-i- This is impossible since A: — 2 is 
odd and k — 2 < k . 

The lemma now follows from Lemmas 12.91 and 12.61 together with Example [TJ □ 

Lemma 4.8 IfG is a digraph such that {x,y) £ E[G] {y,x) ^ E[G], then Max CSP({-ff})- 
B <AP Max CSP {{G})-B , where H is the undirected graph obtained from G by replacing 
every edge in G by two edges in opposing directions in H . 

Proof: H{x, y) + [1 — 1) — G{x, y) + G{y, x) is a strict implementation of H and the result 
follows from Lemma [231 ^ 

Lemma 4.9 If G is a non-empty non-valid vertex-transitive digraph, then Max CSP ({G})-i? 
is hard to approximate. 

Proof: By Lemmas 12.121 and 12.61 it is enough to consider cores. For directed cycles, the 
results follows from Lemmas 14.71 and [T8| and, for all other digraphs, from Corollary 14.61 □ 
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4.1.2 General Digraphs 



The main lemma of this section is Lemma [4. 161 which proves our result for general digraphs. 
We begin by considering bipartite digraphs. 

Lemma 4.10 IfG is a bipartite digraph which is neither empty nor valid, then Max CSP({G})- 
B is hard to approximate. 

Proof: If there are two edges {y,x) G E[G], then the core of G is isomorphic to P2 

and the result follows from Lemmas 12.61 and 12.121 together with Example [TJ If no such pair 
of edges exist, then Lemmas 12.61 and reduce this case to the previous case where there are 
two edges (a;, y), (y, a;) e i?[G']. □ 

We will use a technique known as domain restriction [23] in the sequel. For a subset 
D' C D, let r|^, = {^1^, I i? e r and is non-empty}. The following lemma was proved 
in |23[ Lemma 3.5] (the lemma is stated in a slightly different form there, but the proof together 
with [6l Lemma 8.2] and Lemma [2^ implies the existence of the required AP- reduction). 

Lemma 4.11 Let D' <Z D and D' e T, then Max CSP(r|^,)-S <ap Max CSP(r)-B. 

Typically, we will let D' be an orbit in the automorphism group of a graph. We are 
now ready to present the three lemmas that are the building blocks of Lemma 14.151 Let 
G be a digraph. For a set A C V[G], we define A+ = {j \ e E[G],i £ A], and 

A- ^{i\{z,3)^E[Glj^A]. 

Lemma 4.12 If a constraint language T contains two unary predicates S, T such that SCiT — 
0, then r strictly implements S UT. 

Proof: Let U ~ S U T. Then U{x) + {1 — I) = S{x) + T{x) is a strict implementation of 
U{x). □ 



Lemma 4.13 Let H he a core digraph and Vl an orbit in Aut{H) . Then, H strictly implements 
fl'^ and il^ . 

Proof: Assume that H e Rd where D — {1,2, . . . ,p} and (without loss of generality) assume 
that 1 G ri. We construct a strict implementation of fl'^; the other case can be proved in a 
similar way. Consider the function 

g{zi,...,Zp)= ^ H{z„Zj). 

By combining the fact that 77 is a core with Theorem 1 in ^5], one sees that the following holds: 
g{zi, . . . ,Zp) = \E[H]\ if and only if the function {1 1-^ zi, . . . ,p 1— > Zp] is an automorphism of 
H . This also implies that a necessary condition for g{zi, . . . , Zp) = \E[H] \ is that zi is assigned 
some element in the orbit containing 1, i.e. the orbit Vl. We claim that il+ can be strictly 
implemented as follows: 

n'^{x) + (a - 1) = max(iJ(zi, x) + g{z)) 

z 

where z = {zi, zi,..., Zp) and a — \E[H]\ + 1. 

Assume first that x £ il+ and choose y £ ft such that H{y,x) = 1. Then, there exists an 
automorphism a such that cr(l) — y and H{zi,x) + g{z) 1 + by assigning variable 

Zi, 1 < i < p, the value cr{i). 
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If X ^ then there is no y G such that H{y,x) — 1. If the constraint H{zi,x) 
is to be satisfied, then zi must be chosen such that zi ^ fl. We have already observed 
that such an assignment cannot be extended to an automorphism of H and, consequently, 
H{zi,x) + g{z) < 1 + whenever zi ^ ft. However, the assignment Zi — i, 1 < i < p, 

makes H[zi,x) + g{z) = \E[H] \ since the identity function is an automorphism of H. □ 

Lemma 4.14 If H is a core digraph and Vt an orbit in Aut{H), then, for every k, there is a 
number k' such that Max CSP({i7|o})-fc <ap Max CSP({ff})-fc'. 

Proof: Let V[H] — {1,2, ... ,p} and arbitrarily choose one element d G fi. Let X = {V, C) 
be an arbitrary instance of Max CSP({i? |o})-fc and let V — {vi, . . . ,«„}. We construct an 
instance T = (V U C" U C) of Max CSP({i7})-A:' (k' will be specified below) as follows: 
for each variable Vi ^ V: 

1. Add fresh variables w},...,w^ to V' . For each (a, 6) G E[H], add k copies of the 
constraint H{w^,w\) to C . 

2. Identify the variables Vi and wf and remove Vi from V . 

It is clear that there exist an integer k' , independent of X', such that 2' is an instance of 
Max CSV{{H})-k'. 

Let s' be a solution to X' . For an arbitrary variable Vi G V, if there is some constraint in C 
which is not satisfied by s', then we can get another solution s" by modifying s' so that every 
constraint in C is satisfied (if H{wf,wf) is a constraint which is not satisfied by s' then set 
s"(w^) = a and s"{w\) = b). We will denote this polynomial-time algorithm by P' , so s" = 
P'{s'). The corresponding solution to X will be denoted by P{s'), so P{s'){vi) = P' {s'){wf). 

The algorithm P may make some of the constraints involving vi unsatisfied. However, 
the number of copies, k, of the constraints in C implies that m{X',s') < m{X' ,P'{s')). In 
particular, this means that any optimal solution to X' can be used to construct another optimal 
solution which satisfies all constraints in C . 

Hence, for each Vi G V, all constraints from step 1 are satisfied by s" = P'(s'). As H is 
a core, s" restricted to w}, . . . (for any Vi G V) induces an automorphism of H. Denote 
the automorphism by / : V[H] — > V[H] and note that / can be defined as f{x) = s"{wf). 
Furthermore, s"{wf) G O for all wf &V since d £ Q. 

To simplify the notation we let I = By a straightforward probabilistic argument 

we have OPt(T) > p-IC*!. Using this fact and the argument above we can bound the optimum 
of X' as follows: 

opt(X') < opt(J) + kl\V\ 

< opt{X) + 2kl\C\ 

< OPT (J) + 2fcp^OPT(J) 
= (1 + 2fcp2)0PT(J). 

From Lemma [221 we know that there exists a polynomial-time approximation algorithm A 
for Max CSP(i?|j^). Let us assume that ^ is a c-approximation algorithm, i.e., it produces 
solutions which are c-approximate in polynomial time. We construct the algorithm G in the 
v4P- reduction as follows: 

, _ r P{s') if mix, P{s')) > mix, A(X)), 
^[-L,s )~- <^ A(J) otherwise. 
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We see that OPT(X)/m(X, G(X, s')) < c. Let s' be a r-approximate solution to 2'. As 
m{X',s') < to(Z', P'(s')), we get that -P'(s') is a r-approximate solution to X' , too. Further- 
more, since P'{s') satisfies all constraints introduced in step 1, we have OPt(X')— m(X', P'(s')) = 
OPt(X) - m{I, Pis')). Let ;5 = 1 + 2kp'^ and note that 



OPT(I) m{I, P{s')) 0PT(I') - m(I', P'{s')) 



m{I,G{I,s')) m(I,G{I,s')) m{X,G{l,s')) 



< 



^ OPTiI')-m{I',P'is')) ^ 
- m{I,G{I,s')) 

^ ^ opT{r)-mir,p'is')) ^ 

~ opt(X) ~ 

^ ^ _^ OFT{I')-m{I',P'{s')) ^ 

~ opt(X') ~ 

^ OFTiI')-miJ',P'{s')) ^ 

^ m{I',P'{s')) 
< l + c/3(r-l). 



□ 



Lemma 4.15 Let H be a non-empty non-valid digraph such that 

• \V[H]\ > 2, 

• H is a core, and 

• H is not vertex-transitive. 

Then, either (a) Max CSP({-ff})-i? is hard to approximate, or (b) there exists a proper subset 
X of V such that \X\ > 2, H\j^ is non-empty, is non-valid and for every k there exists 

a k' such that Max CSP({iJ|^})-fc <ap Max CSP({ff})-fc'. 

Proof: We split the proof into three cases. 

Case 1: There exists an orbit ili C V[H] such that flf contains at least one orbit. 

If H\^^ is non-empty, then we get the result from Lemma B. 141 since fii C V[H] (we cannot 
have I ill I = 1 because then H would contain a loop). Assume that H\^_^ is empty. As H\^_^ 
is empty, we get that is a proper subset of If ^^1^^+ is non-empty, then we get the 

result from Lemmas l4. 13112.91 and 14.111 Hence, we assume that H\^+ is empty. 

Arbitrarily choose an orbit n2 C il^ and note that ilf n ^2 ^ ^ since is empty. 

If U C V[H], then we get the result from Lemmas 03t [TOj liA2l and KTH because 
^\n+un- ^® non-empty. Hence, we can assume without loss of generality that Q^[JQ2 — 
and since fl^ fl — 0, we have an partition of V[H] into the sets fl^ and Sl^. Using the 
same argument as for , we can assume that i^j^j- is empty. Therefore, ,^2 ^ partition 

of V[H] and H\^+,H\^^- are both empty. This impHes that H is bipartite and we get the 
result from Lemma r4.1ot 

Case 2: There exists an orbit fli C V[H] such that il^ contains at least one orbit. 

This case is analogous to the previous case. 
Case 3: For every orbit C ^[i?], neither 17+ nor 17 contains any orbits. 
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Pick any two orbits fli and fl2 (not necessarily distinct). Assume that there are x ^ ili 
and y € Q2 such that {x,y) G E[H]. Let z be an arbitrary vertex in ^2. Since f22 is an 
orbit of H, there is an automorphism p € Aut{H) such that p{y) — z, so {p{x),z) G E[H]. 
Furthermore, fii is an orbit of Aut(iJ) so p{x) G fii. Since z was chosen arbitrarily, we 
conclude that ^^2 Q ■ However, this contradicts our assumption that neither fi^ nor fij" 
contains any orbit. We conclude that for any pair fli, fl2 of orbits and any x G fli, y G il.2, 
we have {x,y) ^ E[G]. This implies that H is empty and Case 3 cannot occur. □ 

Lemma 4.16 Let H be a non-empty non-valid digraph. Then, Max CSP{{H})-B is hard to 
approximate. 

Proof: Due to Lemmas 12.121 and 12.61 we can assume that H is a core. If H is vertex- 
transitive, then the result follows from Lemma 14.91 If H is not vertex-transitive, then we 
can obtain, by Lemma I4.15[ a smaller graph G such that G has at least two vertices, G is 
non-empty, G is non-valid, and Max CSP(G)-B <ap Max CSP{H)-B. By repeatedly using 
Lemma [4.151 we will eventually obtain either a graph which is vertex-transitive graph or a 
proof of approximation hardness. In the former case the result follows from Lemma ITOl □ 



4.2 Main Result 

Armed with the previous lemmas, it is sufficient to provide an arity reduction argument 
(Lemma 14. 1 71 below) and assemble the various pieces to prove the main theorem. Lemma [4. 171 
was first proved in [36] but we repeat the proof here to make this report more self-contained. 

Lemma 4.17 If R is a non-empty non-valid relation of arity n > 2, then R strictly imple- 
ments a binary non-empty non-valid relation. 

Proof: We prove the lemma by induction on the arity of R. The result trivially holds for 
n — 2. Assume that the result holds for 71 = fc, fc > 2. We show that it holds for n = fc + 1. 
Assume first that there exists {ai , . . . , at+i) G Z)*^"*"^ such that R{ai, . . . , ak+i) = 1 and 
|{ai, . . . , afe+i}| < k. We assume without loss of generality that = a^+i and consider the 
predicate R'{xi, . . . ,Xk) — R{xi, . . . ,Xk,X}~). Note that this is a strict 1-implementation and 
i?'(d, . . . , d) = for all d & D. Furthermore, note that R' is non-empty since R'{ai, . . . , a^) = 
1. 

Assume now that |{ai, . . . , afc+i}| = fc + 1 whenever R{ai, . . . , ak+i) — 1. Consider the 
predicate R'{xi, . . . ,Xk) = maxj, R{xi, . . . , Xfc, y), and note that this is a strict 1-implementation, 
We see that R' {d, . . . ,d) — Q for all d G D (due to the condition above) and R' is non-empty 
since R is non-empty. □ 

We are finally able to state the proof of the main theorem of this section. Theorem 14. II 
Proof: Let i? be a relation in i?^'^ Clearly, Max CSP({-R}) can be solved in polynomial 
time if R is valid. If R is empty, then all solutions have the same measure. 

Otherwise, if R is non-empty and not valid, then we can, due to Lemma 14.171 strictly 
implement a binary relation R' with R such that R' is neither valid nor empty. Together with 
Lemma [4 . 1 61 and Lemma [231 we get the desired result. □ 

We will now give a simple example on how Theorem 14.11 can be used for studying the 
approximability of constraint languages. Consider the following observation: Let F be a 
constraint language, i? G F and an orbit in Aut(F). Then, R\^ is either d-valid for every 
d G or not d- valid for any d G 51. 
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Proposition 4.18 Let O = {fl \ is an orbit in Aut(r)} and let T be a constraint language 
such that OCT. If T contains a k-ary, k > 1, relation R that contains a tuple [ti, . . . , t}~) 
such that R is not ti-valid, for any 1 < i < k, then Max CSP(r) is hard to approximate. 

Proof: We can view the unary relation 

U = [J{n eO \ ti en for some I < i < k} 

as a member of F due to Lemma [4.121 Now, R\jj is a non-empty, non- valid relation and 
approximability hardness follows from Lemmas 12. 9[ 14. 11^ and Theorem 14.11 □ 

Corollary 4.19 Let T be a constraint language such that Aut(r) contains a single orbit. IfT 
contains a non-empty k-ary, k > I, relation R which is not d-valid for all d £ D, then Max 
CSP(r) is hard to approximate. Otherwise, Max CSP(r) is tractable. 

Proof: If a relation R with the properties described above exists, then Max CSP(r) is hard 
to approximate by Proposition 14.181 (note that R cannot be c?- valid for any d). Otherwise, 
every A:-ary, k > 1, relation G F is c?- valid for all d S Z?. If F contains a unary relation 
U such that U C D, then Aut(r) would contain at least two orbits which contradict our 
assumptions. It follows that Max CSP(F) is trivially solvable. □ 

Note that the constraint languages considered in Corollary 14. 191 may be seen as a general- 
isation of vertex-transitive graphs. 

4.3 Max CSP and Supermodularity 

In this section, we will prove two results whose proofs make use of Theorem l4.1l The first result 
(Proposition l4.25|) concerns the hardness of approximating Max CSP(r) for F which contains 
all at most binary relations which are 2-monotone (see Section 14.3.11 for a definition) on some 
partially ordered set which is not a lattice order. The other result, Theorem 14.271 states that 
Max CSP(r) is hard to approximate if F contains all at most binary supermodular predicates 
on some lattice and in addition contains at least one predicate which is not supermodular on 
the lattice. 

These results strengthens earlier published results [HI HI] in various ways (e.g., they apply 
to a larger class of constraint languages or they give approximation hardness instead of NP- 
hardness) . In Section 14.3.11 we give a few preliminaries which are needed in this section while 
the new results are contained in Section [4.3.21 

4.3.1 Preliminaries 

Recall that a partial order C on a domain D is a lattice order if, for every x,y € D, there exist 
a greatest lower bound x Fly and a least upper bound xUy. The algebra C = {D; U) is a 
lattice, and xUy — y <=> xFly ~ x x \C y. We will write x \Z y if x ^ y and x \C y. All 

lattices we consider will be finite, and we will simply refer to these algebras as lattices instead 
of using the more appropriate term finite lattices. The direct product of C, denoted by is 
the lattice with domain D" and operations acting componentwise. 

Definition 4.20 (Supermodular function) Let C be a lattice on D. A function f : D" 
M is called supermodular on C if it satisfies, 

f(a) + f{b)<f{anb) + f{aub) (2) 

for alla. beD"^. 
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The set of all supermodular predicates on a lattice L will be denoted by Spmod£ and a 
constraint language F is said to be supermodular on a lattice £ if F C Spmod^. We will 
sometimes use an alternative way of characterising supermodularity: 

Theorem 4.21 ([24]) An n-ary function f is supermodular on a lattice C if and only if it 
satisfies inequality ^ for all a — (ai, 02, . . . , a„), b — (61, 62, ... , bn) S such that 

1. ai = bi with one exception, or 

2. ai = bi with two exceptions, and, for each i, the elements ai and bi are comparable in C. 
The following definition first occurred in p[6]. 

Definition 4.22 (Generalised 2-monotone) Given a poset V = {D, C), a predicate f is 
said to be generalised 2-monotone on V if 

f{x) = 1 ((xi, C a,J A . . . A {x,^ C at J) V {{xj^ □ 6j J A . . . A {xj^ □ b^J) 

where x — {xi,X2, ■ ■ ■ ,Xn) and ai^,. . . ,ai^,bj^, . . .,bj^ G D, and either of the two disjuncts 
may be empty. 

It is not hard to verify that generalised 2-monotone predicates on some lattice are su- 
permodular on the same lattice. For brevity, we will use the word 2-monotone instead of 
generalised 2-monotone. 

The following theorem follows from [231 Remark 4.7]. The proof in [23] uses the corre- 
sponding unbounded occurrence case as an essential stepping stone; see [20j for a proof of this 
latter result. 

Theorem 4.23 (Max CSP on a Boolean domain) Let D = {0, 1} andT C Rd be a core. 
IfV is not supermodular on any lattice on D, then Max CSP(r)-i3 is hard to approximate. 
Otherwise, Max CSP(F) is tractable. 

4.3.2 Results 

The following proposition is a combination of results proved in [16] and |41| . 
Proposition 4.24 

• // F consists of 2-monotone relations on a lattice, then Max CSP(F) can be solved in 
polynomial time. 

• Let V = {D, C) be a poset which is not a lattice. If F contains all at most binary 
2-monotone relations on V , then Max CSP(F) is NP-hard. 

We strengthen the second part of the above result as follows: 

Proposition 4.25 Let \— be a partial order, which is not a lattice order, on D. IfT contains 
all at most binary 2-monotone relations on then Max CSP(F)-i3 is hard to approximate. 

Proof: Since C is a non-lattice partial order, there exist two elements a,b € D such that 
either a □ 6 or a U 6 do not exist. We will give a proof for the first case and the other case can 
be handled analogously. 

Let g{x, y) = I <J=> (x ^ a) /\ {y ^ b). The predicate g is 2-monotone on "P so 5 £ F. We 
have two cases to consider: (a) a and b have no common lower bound, and (b) a and b have 
at least two maximal common lower bounds. In the first case g is not valid. To see this, note 



25 



that if there is an element c € D such that g{c, c) — 1, then c Q a and c Q b, and this means 
that c is a common lower bound for a and b, a contradiction. Hence, g is not valid, and the 
proposition follows from Theorem 14.11 

In case (b) we will use the domain restriction technique from Lemma 14.111 together with 
Theorem 14.11 In case (b), there exist two distinct elements c,d G D, such that c, d C a and 
c,d\—b. Furthermore, we can assume that there is no element z ^ D distinct from a, 6, c such 
that c C z C a, 6, and, similarly, we can assume there is no element z' £ D distinct from a, 6, d 
such that d C z' □ a, 6. 

Let f{x) = 1 <;=4> (x □ c) A (a; □ d). This predicate is 2-monotone on V. Note that there 
is no element z G D such that f{z) — 1 and g{z, z) = 1, but we have /(a) = f{b) = g{a, b) = 1. 
By restricting the domain io D' = {x e D \ f{x) = 1} with Lemma 14.111 the result follows 
from Theorem 14. II □ 

A diamond is a lattice £ on a domain D such that — 2 elements are pairwise incom- 
parable. That is, a diamond on \D\ elements consist of a top element, a bottom element and 
\D\ — 2 elements which are pairwise incomparable. The following result was proved in [32]. 

Theorem 4.26 Let T contain all at most binary 2-monotone predicates on some diamond C. 
IfV % Spmod^, then Max CSP(r) is NP-hard. 

By modifying the original proof of Theorem I4.26[ we can strengthen the result in three 
ways: our result applies to arbitrary lattices, we prove inapproximability results instead of 
NP-hardness, and we prove the result for bounded occurrence instances. 

Theorem 4.27 LetT contain all at most binary 2-monotone predicates on an arbitrary lattice 
^- If^ % Spmod^, then Max CSP(r)-i3 is hard to approximate. 

Proof: Let / G F be a predicate such that / ^ Spmod£. We will first prove that / can 
be assumed to be at most binary. By Theorem 14. 2H there is a unary or binary predicate 
/' ^ Spmod^ which can be obtained from / by substituting all but at most two variables 
by constants. We present the initial part of the proof with the assumption that /' is binary 
and the case when /' is unary can be dealt with in the same way. Denote the constants by 
as, a4, . . . , a„ and assume that f'{x, y) = f{x, y, 03, 04, . . . , a„). 

Let fc > 5 be an integer and assume that Max CSP(r U {/'})-fc is hard to approximate. 
We will prove that Max CSP(r)-fc is hard to approximate by exhibiting an AP-reduction 
from Max CSP(r U {/'})-fc to Max CSP(F)-A:. Given an instance I = {V,C) of Max 
CSP(ru {/'})-fc, where C = {Ci, C2, . . . , CJ, we construct an instance I' = {V , C) of Max 
CSP(r)-fc as follows: 

1. for any constraint {f',v) — Cj G C, introduce the constraint (/,«') into C, where 
t)' = (ui, 1)2, 2/3, . . ■ , y^), and add the fresh variables y?^,yl, ... ,yi^ to V . Add two copies 
of the constraints yf Q ai and C yf for each i G {3, 4, . . . , n} to C". 

2. for other constraints, i.e., {g,v) G C where g ^ /', add {g,v) to C. 

It is clear that X' is an instance of Max CSP(F)-A:. If we are given a solution s' to 2', 
we can construct a new solution s" to T by letting s"{yf) = for all i,j and s"{x) = s'{x), 
otherwise. Denote this transformation by P, so s" = P{s'). It is not hard to see that 
m(J',P(s')) >m{T,s'). 

From Lemma l2.2l we know that there is a constant c and polynomial-time c- approximation 
algorithm A for Max CSP(F U {/'}). We construct the algorithm G in the ylP-reduction as 
follows: 

, _ r P{.s% if m(X,P(s')|^) > m(X,A(J)), 
' \ A(X) otherwise. 
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We see that OPT(J)/m(X, G{I, s')) < c. 

By Lemma[221 there is a constant c' such that for any instance T of Max CSP(r), we have 
OPt(X) > c'\C\. Furthermore, due to the construction of Z' and the fact that m(X' ,P{s')) > 
■m{X' , s'), we have 



opt(I') < opt(X) + 4(n - 2)\C\ 

4{n - 2) 



4fn — 2) 

< OPXm + ; ■ OPT (I) 



< OPT (J) 

Let s' be an r-approximate solution to 1' . As m{l',s') < m{I' , P{s')), we get that P{s') 
also is an r-approximate solution to X' . Furthermore, since P{s') satisfies all constraints 
introduced in step 1, we have OPt(I') - m(X' , P{s')) = opt(X) - m(X, P{s')\^). Let f3 = 
1 + 4(n - 2)/c' and note that 

OPt(X) _ 



m{X,G{X, s')) 

m{X,P{s%) oPT{X')~m{X',P{s')) 
m{X, G{X, s')) ^ m{X, G(I, s')) 



< 



^ OFTjX') -m{X\P{s')) ^ ^ ^ ^ OFTjX') -m{X\P{s')) ^ 



m{X,G{X,s')) - OPt(I) 

^ OPTiX') -m{X',P{s')) ^ OFT{X')~m{X',Pis')) ^ 

- ^ OPt(2:') - -I- P m{X',P{s')) 



< 1 + c/3(r - 1). 

We conclude that Max CSP(r)-fc is hard to approximate if Max CSP(r U {f'})-k is hard 
to approximate. 

We will now prove that Max CSP(r)-i? is hard to approximate under the assumption 
that / is at most binary. We say that the pair (a, b) witnesses the non-supermodularity of f 
if f{a) + f{b) ^ fianb) + fiaUb). 

Case 1: / is unary. As / is not supermodular on C, there exists elements a,b £ C such that 
(a, 6) witnesses the non-supermodularity of /. 

Note that a and b cannot be comparable because we would have {a U 6, a □ 6} = {a, 6}, and 
so /(a U 6) -I- /(a □ 6) = /(a) -I- f{b) contradicting the choice of (a, b). We can now assume, 
without loss of generality, that /(a) — 1. Let z^, = aHb and z* — aUb. Note that the two 
predicates u{x) = 1 <;=^ a; C z* and u'{x) = 1 <==^ z» □ a; are 2-monotone and, hence, 
contained in F. By using Lemma [4.111 it is therefore enough to prove approximation hardness 
for Max CSP(r|^,)-B, where = {x G £> | C a; □ z*}. 

Subcase la: /(a) = 1 and f{b) — 1. At least one of f{z*) — and /(z*) = must hold. 

Assume that f{z^) = 0, the other case can be handled in a similar way. Let g{x,y) = 
1 -4=> [{x E a) A C b)] and note that g is 2-monotone so g G F. 

Let d be an arbitrary element in D' such that g{d,d) — 1. From the definition of g we know 
that d C a,6 so C which implies that d = z,. Furthermore, we have g{a,b) = l,/(a) = 

= 1, and /(z,) = 0. Let D" ^ {x e D' \ f(x) = 1}. By applying Theorem O to 
we see that Max CSP(r|^„)-S is hard to approximate. Now Lemma [4 . 1 1 1 implies the result 
for Max CSP(r|^,)-B, and hence for Max CSP(r)-B. 

Subcase lb: /(a) = 1 and /(&) = 0. In this case, f{z*) = and /(z*) = holds. 
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Table 1: Possibilities for g. 



X 


y 


ti{x) 




9{x,y) 








ai 


b2 








1 





1 


ai 


a2 


1 


1 1 


1 


1 





bi 


b2 


1 


1 1 


1 


1 


1 


bi 


a2 


1 









If there exists d € D' such that 6 C d C z* and f{d) = 1, then we get /(a) = 1, f{d) = 1, 
aU d = z* and f{z*) = 0, so this case can be handled by Subcase la. Assume that such an 
element d does not exist. 

Let u{x) — 1 <=i' b \Z X. The predicate u is 2-monotone so u G F. Let h{x) = 
f]D'{x) + u\d'{x). By the observation above, this is a strict implementation. By Lemmas 12.91 
and l2.6( it is sufficient to prove the result for T' = T\d' U {h}. This can be done exactly as in 
the previous subcase, with D" — {x G D' \ h{x) — 1}. 

Case 2: / is binary. We now assume that Case 1 does not apply. By Theorem 14.211 there 
exist ai , 02 , 6i , 62 such that 

f{ai,a2) + fibiM) ^ /(aiU6i,a2U62) + /(ain6i,a2n62) (3) 

where ai, bi are comparable and 02, 62 are comparable. Note that we cannot have ai □ bi and 
0-2 E ^2, because then the right hand side of |[3]) is equal to /(6i,62) + f{ai,a2) which is a 
contradiction. Hence, we can without loss of generality assume that ai C bi and 62 E 02. 

As in Case 1, we will use Lemma [4. Ill to restrict our domain. In this case, we will consider 
the subdomain D' = {a; £ D | C x C z*} where = oi Fl 62 and z* = 02 U 61. As the two 
predicates Mz* (x) and (x), defined by Mz» (2;) = 1 x Q z* and Uz^(x) = 1 <^=> z* □ a;, 

are 2-monotone predicates and members of F, Lemma 14.111 tells us that it is sufficient to prove 
hardness for Max CSP(F')-B where F' = r|^,. 

We define the functions ti : {0, 1} {a;, bi}, i = 1, 2 as follows: 

• ti{0) — fli and ti(l) — bi; 

• hiO) = 62 and t2(l) = 02- 

Hence, ti(0) is the least element of and bi and U^l) is the greatest element of and bi. 

Our strategy will be to reduce a certain Boolean Max CSP problem to Max CSP(F')-i?. 
Define three Boolean predicates as follows: g{x,y) = f{ti{x),t2{y)), co{x) = 1 <=4> x = 
0, and ci(x) — 1 x = I. One can verify that Max CSP{{co,ci,g})-B is hard to 

approximate for each possible choice of g, by using Theorem 14.231 consult Table [T] for the 
different possibilities of g. 

The following 2-monotone predicates (on D') will be used in the reduction: 

h,{x,y) = l ^ [(a;Cz,)A(y □t,(0))] V[(z* Cx)A(t,(l) □2/)],i = l,2. 

The predicates /ii,ft.2 are 2-monotone so they belong to F'. We will also use the following 
predicates: 

• Ld{x) = 1 ■<==^ X \Z d, 

• Gd{x) = 1 <;===> d\— X, and 
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• Nd,d'{x) = 1 (x C d) V {d' □ x) 

for arbitrary d,d' G I?'. These predicates are 2-monotone. 

Let w be an integer such that Max CSP({g, cq, ci})-?« is hard to approximate; such an 
integer exists according to Theorem 14.231 Let X — {V,C), where V = {xi,X2, ■ ■ ■ ,Xn} and 
C = {Ci, . . . , Cm}, be an instance of Max CSP({(7, cq, ci})-w. We will construct an instance 
T of Max CSP(r')-w', where w' = 8w + 5, as follows: 

1. For every d £ C such that Ci — g{xj,Xk), introduce 

(a) two fresh variables and yl, 

(b) the constraint /(y], y^), 

(c) 2w + 1 copies of the constraints Lt^ (yj), Ga^ (y]), A^ai.bi (Vj), 

(d) 2w + 1 copies of the constraints La^iUk) ^ ^biiVk) i ^b2,a2{yk) , and 

(e) 2w + 1 copies of the constraints hi{xj,yj), h2{xk,yl)- 

2. for every Ci G C such that Ci — co(a;j), introduce the constraint Lz,{xj), and 

3. for every Ci d C such that Ci = ci{xj), introduce the constraint Gz*{xj). 

The intuition behind this construction is as follows: due to the bounded occurrence prop- 
erty and the quite large number of copies of the constraints in steps Ic, Id and le, all of those 
constraints will be satisfied in "good" solutions. The elements and 1 in the Boolean problem 
corresponds to and z*, respectively. This may be seen in the constraints introduced in steps 
2 and 3. The constraints introduced in step Ic essentially force the variables yj to be either 
ai or bi, and the constraints in step Id work in a similar way. The constraints in step le 
work as bijective mappings from the domains {ai,6i} and {02,62} to {z*,z*}. For example, 
hi{xj,yj) will set Xj to if y* is oi, otherwise if y* is bi, then xj will be set to z*. Finally, 
the constraint introduced in step lb corresponds to g{xj,Xk) in the original problem. 

It is clear that J' is an instance of Max CSP(r')-w'. Note that due to the bounded 
occurrence property of X', a solution which does not satisfy all constraints introduced in steps 
Ic, Id and le can be used to construct a new solution which satisfies those constraints and 
has a measure which is greater than or equal to the measure of the original solution. We will 
denote this transformation of solutions by P. 

Given a solution s' to X', we can construct a solution s — G(s') to X by, for every x G V, 
letting s{x) = if P{s'){x) — and s{x) — 1, otherwise. 

Let M be the number of constraints in C of type g. We have that, for an arbitrary 
solution s' to r, m(X',P(s')) = m{X,G{s')) + 8{2w + 1) • M > m{X',s'). Furthermore, 
OPt(X') = OPt(/) + 8(2u; + 1)M. 

Now, assume that OPt(X')/to(I', s') < e' . It follows that OPT(X')/m(X', P(s')) < e' and 

OPt(/) + 8(2w; + 1)M , 
m(/, G(s')) + 8(2u; + 1)M " ^ ^ 
OPt(/) < e'm{I, G(s')) + (e' - l)8(2u; + 1)M ^ 

opt(T) , 8{2w + l)Mie' - 1) 
m{X,G{s')) - ^ ^ m(X,G(s')) ' 

Furthermore, by standard arguments, we can assume that m{X,G{s')) > \C\/c, for some 
constant c. We get, 

-^^^<e' + 8(2z« + l)c(e'-l). 
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Hence, a polynomial time approximation algorithm for Max CSP(r')-w' with performance 
ratio e' can be used to obtain ^''-approximate solutions, where e" is given by e' + 8(2?ii+l)c(e' — 
1), for Max CSP({co, ci, in polynomial time. Note that e" tends to 1 as e' approaches 1. 

This impHes that Max CSP(r')-w' is hard to approximate because Max CSP({co,ci,g})-u; 
is hard to approximate. □ 



5 Conclusions and Future Work 

This report have two main results: the first one is that Max CSP(r) has a hard gap at 
location 1 whenever F satisfies a certain condition which makes CSP(r) NP-hard. This 
condition captures all constraint languages which are currently known to make CSP(r) NP- 
hard. This condition has also been conjectured to be the dividing line between tractable (in 
P) CSPs and NP-hard CSPs. The second result is that single relation Max CSP is hard to 
approximate except in a few cases where optimal solutions can be found trivially. 

It is possible to strengthen these results in a number of ways. The following possibilities 
applies to both of our results. 

We have paid no attention to the constant which we prove inapproximability for. That is, 
given a constraint language F, what is the smallest constant c such that Max CSP(F) is not 
approximable within c — £ for any e > in polynomial time? For some relations a lot of work 
has been done in this direction, cf. [6] for more details. 

We have a constant number of variable occurrences in our hardness results, but the constant 
is unspecified. For some problems, for example Max 2Sat, it is known that allowing only 
three variable occurrences still makes the problem hard to approximate (even APX-hard) [6]. 
This is also true for some other Max CSP problems such as Max Cut [T]. This leads to the 
questions: is Max CSP({i?})-3 hard to approximate for all non-valid non-empty Kl and, is 
it true that Max CSP(F)-3 has a hard gap at location 1 whenever Max CSP(F)-i3 has a 
hard gap at location 1? 

One of the main open problems is to classify Max CSP(F) for all constraint languages 
F, with respect to tractability of finding an optimal solution. The current results in this 
direction [161 [Ml O 02] seems to indicate that the concept of supermodularity is of central 
importance for the complexity of Max CSP. However, the problem is open on both ends 
— we do not know if supermodularity implies tractability and neither do we know if non- 
supermodularity implies non-tractability. Here "non-tractability" should be interpreted as "not 
in PO" under some suitable complexity-theoretic assumption, the questions of NP-hardness 
and approximation hardness are, of course, also open. 
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